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1. Introduction 



A fundamental problem in data structures is maintaining an ordered set S ofn items drawn from 
a universe U of size M ^ n. For a totally ordered U, the dictionary operations: insert; delete; test 
membership; and predecessor are all supported in O(logn) time and 0(n) space in the comparison 
model via balanced binary search trees. Here we consider the relaxed problem where U is partially 
ordered and give the first data structure for maintaining a dynamic partially ordered set drawn 
from a universe that can be described by a tree. 

As a motivating example, consider an email user that has stockpiled years of messages into a 
series of hierarchical folders. When searching for an old message, filing away a new message, or 
removing an impertinent message, the user must navigate the hierarchy. Suppose the goal is to 
minimize, in the worst-case, the number of folders the user must consider in order to find the 
correct location in which to retrieve, save, or delete the message. Unless the directory structure is 
completely balanced, an optimal search does not necessarily start at the top — it might be better 
to start farther down the hierarchy if the majority of messages lie in a sub-folder. If we model 
the hierarchy as a rooted, oriented tree and treat the question "is message x contained somewhere 
in folder as our comparison, then maintaing an optimal search strategy for the hierarchy is 
equivalent to maintaining a dynamic partially ordered set under insertions and deletions. 

Related Work. The problem of searching in trees and partial orders has recently received con- 
siderable attention. Motivating this research are practical problems in filesystem synchronization, 
software testing and information retrieval [Ij. However, all of this work is on the static version of 
the problem. In this case, the set S is fixed and a search tree for S does not support the insertion or 
deletion of elements. For example, when S is totally ordered, the optimal minimum- height solution 
is a standard binary search tree. In contrast to the totally ordered case, finding a minimum height 
static search tree for an arbitrary partial order is NP-hard [2j. Because of this, most recent work 
has focused on partial orders that can be described by rooted, oriented trees. These are called 
tree-like partial orders in the literature. For tree-like partial orders, one can find a minimum height 
search tree in linear time 0-0. In contrast, the weighted version of the tree-like problem (where 
the elements have weights and the goal is to minimize the average height of the search tree) is 
NP-hard [6J although there is a constant-factor approximation [7J. Most of these results operate in 
the edge query model which we review in Sec. [2j 

Daskalakis et al. have recently studied the problem of sorting partial orders [HI [9] and, in [9], 
ask for analogues of balanced binary search trees for dynamic partially ordered sets. We are the 
first to address this question. 

Rotations do not preserve partial orders. Traditional data structures for dynamic ordered 
sets {e.g., red black trees, AVL trees) appear to rely on the total order of the data. All these 
data structures use binary tree rotations as the fundamental operations; applied in an unrestricted 
manner, rotations require a totally ordered universe. For example, consider Figure [l] (ii) which 
gives an optimal search tree for the elements {A, . . . ,1} depicted in the partial order of Figure [l] 
(i). If we insert node J (colored grey) then we must add a new test (S, J) below {A,B) which 
creates the sub-optimal search tree depicted in Figure [l] (iii) . Using traditional rotations yields the 
search tree given in Figure [l](iv) which does not respect the partial order; the leaf marked C should 
appear under the right child of test (A^B). Figure [l] (v) denotes a correct optimal search for the 
set {A, . . . , J}. The key observation is that, if we imagine the leaves of a binary search tree for 
a total order partitioning the real line, rotations preserve the order of the leaves, but not any kind 
of subtree relations on them. As a consequence, blindly applying rotations to a search tree for the 
static problem does not yield a viable dynamic data structure. To sidestep this problem, we will, in 
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Figure 1. (i) A partially ordered set {A, B,C, D, E, F,G, H, I, J}. A downward 
path from node X to node Y implies X ^Y. Note that, for example, E ^ F and G 
and / are incomparable, (ii) An optimal search tree for the set {A, S, . . . , /}. For 
any query (X, Y) an answer of X means descend left and an answer of Y means 
descend right, (iii) After adding the element J, a standard search tree would add 
a new query (B, J) below (A, B) which creates an imbalance, (iv) The search tree 
after a rotation; the subtree highlighted in grey is not a correct search tree for the 
partial order (i). (v) An optimal search tree for the set {A, . . . , J}. 

essence, decompose the tree-like partial order into totally ordered chains and totally incomparable 
stars. 



Techniques and Contributions. We define the Line-Leaf Tree, the first data structure that 
supports the fundamental dictionary operations for a dynamic set S CU of n elements drawn from 
a universe equipped with a partial order ^ described by a rooted, oriented tree. 

Our dynamic data structure is based on a static construction algorithm that takes as input the 
Hasse diagram induced by ^ on S' and in 0{n) time and space produces a Line-Leaf Tree for S. 
The Hasse diagram Hs for S is the directed graph that has as its vertices the elements of S and a 
directed edge from x to y if and only if x ^ y and no z exists such that x ^ z ^ y. We build the 
Line-Leaf Tree inductively via a natural contraction process which starts with Hs and, ignoring 
the edge orientations, repeatedly performs the following two steps until there is a single node: 

(1) Contract paths of degree-two nodes into balanced binary search trees (which we can binary 
search efficiently); and 

(2) Contract leaves into linear search structures associated with their parents (which are natural 
search structures since the children of an interior node are mutually incomparable). 

One of these steps always applies in our setting since Hs is a rooted, oriented tree. We give an 
example of each step of the construction in Figure [2j We show that the contraction process yields a 
search tree that is provably within an 0(logit;)-factor of the minimum-height search tree for S. The 
parameter w is the width of S — the size of the largest subset of mutually incomparable elements of 
S — which represents a natural obstacle when searching a partial order. Our construction algorithm 
and analysis appear in Section |3| 

The intuition behind the proof of the approximation ratio is that an optimal search tree for any 
minor of Hs gives a lower bound on an optimal search tree for Hs- Since optimal search trees are 
easy to describe for paths of degree-two nodes as well as for stars, the approximation ratio follows 
by bounding the number of rounds in the contraction process. We also show that our analysis is 
tight. 

To make the Line-Leaf Tree fully dynamic^ in Section [4] we give procedures to update it under 
insertions and deletions. All the operations, take 0{logw) OPT comparisons and RAM operations 
where OPT is the height of a minimum-height search tree for S. Additionally, insertion requires 
only 0{h) comparisons, where h is the height of the Line-Leaf Tree being updated. (The non- 
restructuring operations test membership and predecessor also require at most 0(h) comparisons 
since the Line-Leaf Tree is a search tree). Because it; is a property of 5, in the dynamic setting 
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Figure 2. Examples of (i) a line contraction where we build a balanced binary 
search tree from a path and (ii) a leaf contraction where we build a linear search 
tree from the leaves of a node. 

it changes under insertions and deletions. However, the Line-Leaf Tree maintains the 0{logw) • 
OPT height bound at all times. This means it is well-defined to speak of the 0{logw) • OPT upper 
bound without mentioning S. 

The insertion and deletion algorithms maintain the invariant that the updated Line- Leaf Tree 
is structurally equivalent to the one that we would have produced had the static construction 
algorithm been applied to the updated set S. In fact, the heart of insertion and deletion is correcting 
the contraction process to maintain this invariant. The key structural property of a Line-Leaf 
Tree — one that is not shared by constructions for optimal search trees in the static setting — is that 
its sub-structures essentially represent either paths or stars in allowing for updates that make 
only local changes to each component search structure. The 0(logi(;)-factor is the price we pay 
for the additional flexibility. The dynamic operations, while conceptually simple, are surprisingly 
delicate. As with many data structures, our proofs perform a case analysis which mimics the 
underlying algorithmic definitions of Insert and Delete respectively. 

In Section [5] we provide empirical results on both random and real- world data that show the 
Line-Leaf Tree is strongly competitive with the static optimal search tree. 

2. Models and Definitions 

Let be a finite set of M elements and let ^ be a partial order, so the pair (ZY, ^) forms a 
partially ordered set. We assume the answers to ^-queries are provided by an oracle. (Daskalakis, 
et al. ^ provide a space-efficient data structure to answer ^-queries in 0(1) time.) 

In keeping with previous work, we say that U is tree-like if Hu forms a rooted, oriented tree. 
Throughout the rest of this paper, we assume that U is tree-like and refer to the vertices of Hu 
and the elements of U interchangeably. For convenience, we add a dummy minimal element u to 
U. Since any search tree for a set S embeds with one extra comparison into a corresponding 
search tree for S U {z/}, we assume from now on that u is always present in S. 

Given these assumptions it is easy to see that tree-like partial orders have the following properties: 

Property 1. Any subset S of a tree-like partially ordered universe U is also tree-like. 

Property 2. Every non-root element in a tree-like partially ordered set S C U has exactly one 
predecessor in Hs- 

Let Ts be the undirected (but still rooted and oriented) Hasse diagram for S. 
We extend edge queries to dynamic edge queries by allowing queries on arbitrary pairs of nodes 
in Ts instead of just edges in Ts- 

Definition 3 (Dynamic Edge-queries). Let u be an element in U and x and y be nodes in Ts. 
Let S' — S {u} and consider the edges (x, x^) and (y, y') bookending the unique path from x to 
y in Ts'- Define T^,, r|/ and T™^^ to be the three connected components of Ts' \ {(x, x')^ (y, y')} 
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Figure 3. Given two nodes x and y in S and a third node u a, dynamic edge 
query on (x, y) with respect to u can answer (i) Y, in which case u fahs somewhere 
in the shaded area labehed Y; (ii) X, in which case u falls somewhere in the shaded 
area labelled X; or (iii) HERE, in which case u falls somewhere in the shaded area 
labelled HERE. Notice that if (x, y) forms an actual edge then the query reduces to 
a standard edge query 



containing x, and neither x nor respectively. A dynamic edge query on (x, y) with respect to 
u has one of the following three answers: 

(1) X: if G T^/ {u equals or is closer to x) 

(2) Y: if G T|/ {u equals or is closer to y) 

(3) HERE: if G T™^^ {u falls between, but is not equal to either, x or y) 

Figure [3] gives an example of a dynamic edge query. Any dynamic edge query can be simulated by 
0(1) standard comparisons when Hs is tree-like. This is not the case for more general orientations 
of Hs and an additional data structure is required to implement either our algorithms or algorithms 
of [HI Sj. Thus, for a tree-like 5, the height of an optimal search tree in the dynamic edge query 
model and the height of an optimal decision tree for S in the comparison model are always within 
a small constant factor of each other. For the rest of the paper, we will often drop dynamic and 
refer to dynamic edge queries simply as edge queries. 

3. Line-Leaf Tree Construction and Analysis 

We build a Line- Leaf Tree T inductively via a contraction process on Ts. Each contraction 
step builds a component search structure of the Line-Leaf Tree. These component search struc- 
tures are either linear search trees or balanced binary search trees. A linear search tree LST{x) is 
a sequence of dynamic edge queries, all of the form (x, y) where y ^ that ends with the node x. 
A balanced binary search tree BST{x^ y) for a path of contiguous degree-2 nodes between, but not 
including, x and y is a tree that binary searches the path using edge queries. 

Let To — Ts- If the contraction process takes m iterations total, then the final result is a 
single node which we label T — T2m- In general, let T2i-i be the partial order tree after the line 
contraction of iteration i and T2i be the partial order tree after the leaf contraction of iteration i 
where z > 1. We now show how to construct a Line-Leaf Tree for a fixed tree-like set S. 

Base Cases: Associate an empty balanced binary search tree BST{x, y) with every actual 
edge {x,y) in Tq. Associate a linear search tree LST(x) with every node x in Tq. Initially, 
LST{x) contains just the node itself. 

Line Contraction: Consider the line contraction step of iteration z > 1: If X2, . . . ,xt-i is a 
path of contiguous degree-2 nodes in 72(^_i) bounded on each side by non-degree-2 nodes 
xi and Xt respectively, we contract this path into a balanced binary search tree BST{xi^ xt) 
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Figure 4. The set S under several iterations of the construction algorithm. 



over the nodes X2, . . . ^Xt-i. The result of the path contraction is an edge labeled (xi^xt). 
This edge yields a dynamic edge query. 
Leaf Contraction: Consider the leaf contraction step of iteration i > 1\ U yi, . . . ,yt are all 
degree-1 nodes in T2i-i adjacent to a node x in T2i-i, we contract them into the linear 
search tree LST(x) associated with x. Each node yj contracted into x adds a dynamic edge 
query (x^yj) to LST{x). If nodes were already contracted into LST(x) from a previous 
iteration, we add the new edge queries to the front (top) of the LST. 

After m iterations we are left with T = which is a single node. This node is the root of the 
Line-Leaf Tree. 

3.1. Example Construction. Here we provide an example Line-Leaf Tree construction for a 
partial order on a set S with 23 elements. Figure |4] shows Ts after each step of each round of the 
contraction process. Figure [5] shows the final Line-Leaf Tree. 

Suppose S has the tree structure illustrated in Figure |4] (i). We associate an empty balanced 
binary search tree (BST) with every edge in Tq = Ts and a linear search tree (LST) comprised of 
only the node itself with every node in Tq. The first path contraction creates BSTs for the chains 
J,K}, {P}, {S,T}, {W}, and associates them with the edges (F,L), (F,i?), {R,V), 
(i?, X), respectively. We obtain the tree in Figure [4] (ii) . 

The first iteration ends with a leaf contraction step that adds collections of leaves {A, 5}, E}^ 
{TV}, {V,X}, {y, Z} to the LSTs of elements C, F, L, R, M, respectively. This yields the tree in 
Figure [i] (iii). 

At this point, the next path contraction creates a BST for the single-element chain {L} and 
associates this BST with edge (F^M). Finally, the second leaf contraction reduces the tree to a 
single node by contracting the final leaves C, M, R into the LST of node F as shown in Figure [4] 
(v). This ends the construction process. 
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Figure 5. Line-Leaf Tree for the set S given in Figure |4j The empty nodes 
indicate impossible answers. 

Notice that in Figure [5] some answers to edge queries are left empty. We call these impossible 
answers. This happens because the answer HERE to an edge query (x, y) implies that the node u 
we seek is not equal to either x or but rather lies between them. However, if there is at least one 
node on the path between x and we need to ask the questions of the edges adjacent to nodes 
X and y on that path in order to determine whether u should be placed between two elements. 
Such a question cannot answer x or y since the HERE answer eliminated this possibility. Thus these 
choices are impossible. 

3.2. Searching a Line-Leaf Tree. Searching a Line-Leaf Tree for an element u is tantamount 
to searching the component search structures. A search begins with LST{x) where x is the root 
of T. Searching LST(x) with respect to u serially questions the edge queries in the sequence. 
Starting with the first edge query, if (x, y) answers X then we move onto the next query (x, z) in 
the sequence. If the query answers HERE then we proceed by searching for u in BST{x^y). If it 
answers Y, then we proceed by searching for u in LST{y). If there are no more edge queries left 
in LST{x), then we return the actual element x. When searching BST{x,y), if we ever receive a 
HERE response to the edge query (a, 6), we proceed by searching for u in BST(a^b). That is, we 
leave the current BST and search in a new BST. If the binary search concludes with a node x, then 
we proceed by searching LST{x). Searching an empty BST returns Nil. 

3.3. Implementation Details. The Line-Leaf Tree is an index into Hs but not a replacement 
for Hs- That is, we maintain a separate DAG data structure for Hs across insertions and deletions 
into S. This allows us, for example, to easily identify the predecessor and successors of a node x ^ S 
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once we've used the Line-Leaf Tree to find x in Hs. The edges of Hs also play an essential 
role in the implementation of the Line-Leaf Tree. Namely, an edge query (x, y) is actually two 
pointers: Ai(x, y) which points to the edge (x, a) and A2(x, y) which points to the edge (6, y). Here 
(x, a) and (6, y) are the actual edges bookending the undirected path between x and y in Ts- This 
allows us to take an actual edge (x, a) in memory, rename x to and indirectly update all edge 
queries (x, z) to (w^z) in constant time. Here the path from z to x runs through a. Note that 
we are not touching the pointers involved in each edge query (x, z), but rather, the actual edge in 
memory to which the edge query is pointing. 

Edge queries are created through line contractions so when we create the binary search tree 
BST(x^ y) for the path x^a^ . . . ^b^y^ we let Ai(x, y) = Ai(x, a) and A2(x, y) = A2(6, y). We assume 
that every edge query (x,7/) corresponding to an actual edge (x^y^) has Ai(x,^) = A2(x,7/) = 

{x',y'). 

3.4. Node Properties. We associate two properties with each node in S. The round of a node x 
is the iteration i where x was contracted into either an LST or a EST. We say ROUNd(x) = i. The 
type of a node represents the step where the node was contracted. If node x was line contracted^ 
we say type(x) = line, otherwise we say type(x) = leaf. 

In addition to ROUND and TYPE, we assume that both the linear and binary search structures 
provide a PARENT method that operates in time proportional to the height of the respective data 
structure and yields either a node (in the case of a leaf contraction) or an edge query (in the case 
of a line contraction). More specifically, if node x is leaf contracted into LST (a) then PARENt(x) 
= a. If node x is line contracted into BST{a^h) then PARENt(x) = (a, 6). We emphasize that the 
PARENT operation here refers to the Line-Leaf Tree and not Ts. Cohectively, the round, type, 
and PARENT of a node help us recreate the contraction process when inserting or removing a node 
from S. 

3.5. Approximation Ratio. The following theorem gives the main properties of the static con- 
struction. 

Theorem 4. The worst-case height of a Line-Leaf Tree T built from a tree-like S is Q{logw) • 
OPT where w is the width of S and OPT is the height of an optimal search tree for S. In addition, 
given Hs, T can he built in 0{n) time and space. 

Proof. We begin with some lower bounds on OPT. 

Claim 5. OPT > max{A(S'), logn, logD, logit;} where A(S') is the maximum degree of a node in 
Ts, n is the size of S, D is the diameter of Ts and w is the width of S. 

Proof. Let x be a node of highest degree A{S) in Ts. Then, to find x in the Ts we require at least 
A{S) queries, one for each edge adjacent to x [10]. This imphes OPT > A{S). Also, since querying 
any edge reduces the problem space left to search by at most a half, we have OPT > log n. Because 
n is an upper bound on both the width w of S and the diameter of Ts we obtain the final two 
lower bounds. □ 

Recall that the width w of S is the number of leaves in Ts- Each round in the contraction process 
reduces the number of remaining leaves by at least half: round i starts with a tree T2i on Ui nodes 
with Wi leaves. A line-contraction produces a tree r2i+i, still with Wi leaves. Because T2i^i is full, 
the number of nodes neighboring a leaf is at most Wi/2. Round i completes with a leaf contraction 
that removes all Wi leaves, producing T2i+2- As every leaf in T2i^2 corresponds to an internal node 
of T2i^i adjacent to a leaf, T2i+2 has at most Wi/2 leaves. It follows that the number of rounds is 
at most log It;. The length of any root-to-leaf path is bounded in terms of the number of rounds. 

Lemma 6. On any root-to-leaf path in the Line-Leaf Tree there is at most one EST and one 
LST for each iteration i of the construction algorithm. 
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Figure 6. A tight example of our analysis: starting with a single node (i) we grow 
the partial order tree (ii)-(iv) by adding nodes both horizontally and vertically. 
The contraction process of our Line- Leaf Tree construction algorithm reverses 
the process that grows the tree. 



Proof. On a root-to-leaf path, the Line-Leaf Tree contains LST and BST data structures in 
decreasing order of the iteration i since the data structure is built incrementally from the bottom 
up. Suppose we are currently in LST {a). The search structures immediately accessible from this 
point (aside from ourselves) are: 

• LST{h) for ah queries (a, h) G LST{a) 

• BST{a, c) for ah queries (a, c) G LST{a) 

If (a, 6) G LST{a), then type(6) = leaf and so ROUNd(6) < ROUND(a) by construction. If d is a 
node in BST{a, c), then ROUND((i) < ROUNd(c) < ROUND(a) since d was line contracted before c was 
leaf contracted into LST (a). Now suppose we are currently in BST{a,b). All nodes c contracted 
into this BST have equal ROUND j by construction. The next accessible search structures are: 

• BST{d, e) for ah edge queries (d, e) G BST{a, b) 

• LST{c) for each leaf c of BST{a, b) (this LST may consist of only node c) 

If / is a node in BST{d,e)^ then ROUNd(/) < j since / was line contracted before all nodes in 
BST {a, b) (otherwise, / would be in BST {a, b)). If c is a node in BST {a, b) then ROUND (c) = j. 

Finally, consider a root-to-leaf path. Suppose at some point we are in LST{a) and the next 
search structure we enter is LST{b). It follows from above arguments that ROUND(a) is strictly 
smaller than ROUNd(6). Suppose at some point we are in BST{c,d) and the next structure on 
the path is BST{e, f). Then for all nodes g line contracted into BST{c^d) and all nodes h line 
contracted into BST{e, f), we have ROUNd(^) strictly smaller than ROUNd(/i) and this concludes 
our proof. □ 

For each LST we perform at most A{S) queries. In each BST we ask at most 0{logD) questions. 
By the previous lemma, since we search at most one BST and one LST for each iteration i of the 
contraction process and since there at most log it; iterations, it follows that the height of the Line- 
Leaf Tree is bounded above by: (A(S') + 0{logD)) log it; = 0{logw) • OPT. 

Now we show that in the worst case, the height of T is at least ri(log w) • OPT. Consider growing 
a partial-order tree T both vertically and horizontally according the process depicted in Figure [6j 
call a node free if it has no edge moving in the vertical direction. If Figure [g] (i) depicts the tree 
after iteration 1, Figure [g] (ii) depicts the tree after iteration 2, and so on, then in iteration A; > 1 
we add {k + 1)2^~^ nodes to the tree from iteration k — 1 according the following rules: 

• add 2 children to each of the k2^~'^ free nodes in the vertical direction. 

• add 2^~^ new free nodes just to the left of the rightmost node in the horizontal direction 
(these new nodes collectively form the k^^ base tree). 

Thus, after k iterations there are N = Ej=i(i + 1)2^"^ = Q{k2^) nodes. Since c log TV - log log TV < 
k < logN for any k > 1 and constant c > 1 we have k is 6(logA^). Note also that the width w of 
T is Q(N). An optimal search tree for T uses k edge queries to narrow the search down to one of 
the A: + 1 base trees and then uses an additional 0(k) queries to binary search that base tree. This 
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binary search is possible because in a tree with constant maximum degree, there is always an edge 
that cuts the tree into pieces of size at least Thus OPT = Q{k). However, the contraction 
process on T that inductively defines the Line- Leaf Tree results in a sequence of minors that 
essentially reverses the process of growing T. For example, line- and leaf-contracting Figure |6] (iv) 
yields Figure [g] (iii). Thus, in the unfortunate case that the node we desire is the node just to the 
right of the rightmost node on the horizontal line (i.e. the black node in the figure), the Line-Leaf 
Tree must binary search the horizontal components of each of the k base trees. In other words, 
it must perform X^iLi 0(log(2^)) = 0(/c^) edge queries. Thus the height of the Line-Leaf Tree 
is at least within a factor of Q{k) = 6(log A^) = Q{logw) of the height of the optimal static search 
tree. 

We know prove the time and space bounds. Consider the line contraction step at iteration i: 
we traverse T2(^_i), labeling paths of contiguous degree-2 nodes and then traverse the tree again 
and form balanced BSTs over all the paths. Since constructing balanced BSTs is a linear time 
operation, we can perform a complete line contraction step in time proportional to the size of size 
of T2(^_i). Now consider the leaf contraction step at iteration i: We add each leaf in T2i-i to the 
LST corresponding to its remaining neighbor. This operation is also linear in the size of T2i-i. 
Since we know the size of T2i is halved after each iteration, starting with n nodes in Tq, the total 
number of operations performed is Yl^i^=o^ = 0{n). 

Given that the construction takes at most 0(n) time, the resulting data structure occupies at 
most 0{n) space. □ 

Theorem [4] assumes that the Hasse diagram for S is already constructed. If this is not the case, 
then one must sort the partial order, which, for general partial orders requires Q(n{\ogn + w)) 
comparisons [H [9]. We are unaware of any work that directly addresses the sorting question for 
tree-like partial orders. 

4. Operations 

4.1. Test Membership. To test whether an element A G U appears in T, we search for A in 
LST{x) where x is the root of T. The search ends when we reach a terminal node. The only 
terminal nodes in the Line- Leaf Tree are either leaves representing the elements of S or Nil 
(which are empty BSTs). So, if we find A in T then TEST MEMBERSHIP returns True, otherwise 
it returns False. Given that test membership follows a root-to-leaf path in T, the previous 
discussion constitutes a proof of the following theorem. 

Theorem 7. Test Membership takes 0{h) time. 

4.2. Predecessor. Property [l] guarantees that each node A ^ U has exactly one predecessor in 
S. Finding the predecessor oi Ain S is similar to TEST MEMBERSHIP. We search T until we find 
either A or Nil. Traditionally if A appears in a set then it is its own predecessor, so, in the first 
case we simply return A. In the latter case, A is not in T and Nil corresponds to an empty binary 
search tree BST{y, z) for the actual edge (?/, z) where, say, y ^ z. We know that A falls between 
y and z (and potentially between y and some other nodes) so y is the predecessor of A. We return 
y. Given that PREDECESSOR also follows a root-to-leaf path in T, the previous discussion yields a 
proof of the following theorem. 

Theorem 8. Predecessor takes 0{h) time. 

4.3. Insert. Let A be the node we wish to insert in T and let S' — Syj{A}. Our goal is transform T 
into T' where T' is the Line- Leaf Tree produced through the contraction process when started 
on Ts'. We divide INSERT into three corrective steps: local correction^ down correction^ and up 
correction which we describe below. Local correction repairs the contraction process for elements 
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that appear near A during the contraction process. Down correction repairs T for nodes with round 
at most ROUND (A). Up correction repairs T for nodes with round at least ROUND (A). 

We begin with some notation. Let X be a node such that LST{X) has t edge queries (X, Yi) . . . (X, Yf) 
sorted in descending order by ROUND(y^). In other words, Yi is the last node leaf- contracted into 
LST{X), Yf is the first node leaf- contracted into X and Yi is the (t — z + 1)^^ node contracted into 
LST{X). Define pi{X) = Yi and iii{X) = ROUND(yi). That is, pi{X) yields the (t - i + 1)^^ node 
contracted into LST{X) and //^(X) yields the ROUND of Pi{X). If i > t then pi{X) = 0. 

The following lemma relates the type of a node to the rounds of the nodes contracted into it. 

Lemma 9. Let B be a node in a Line-Leaf Tree such that round (^S^ = k. 

(1) // PARENT (^S^ = NULL then either iJii{B) = k- l>k-2 = l^2{B) = p^i^B) > jiA^B) or 
l^i{B) = i^2{B) = i^siB) = k-l> p4{B). 

(2) //type(S) = LEAF then pi{B) = /X2(S) = /c - 1 > i^siB). 

(3) //type(S) = LINE then pi{B) = k-l> P2{B). 

Proof. The proof follows from the contraction process. If PARENt(5) = NULL then S is a full node 
at iteration k — 1 and is the sole remaining node at iteration k, or B has degree 1 at iteration 
k — 1 and we arbitrarily made it root. If type(S) = LEAF, then B was not contracted at iteration 
k — 1, it was a full node. But B is leaf contracted at iteration A:, thus it has degree 1. Therefore, 
at least two nodes were leaf contracted into LST{B) at iteration k — 1. If type(5) = line, then 
at iteration k — 1, B was a full node. But B is line contracted at iteration A:, thus it has degree 2. 
Therefore, at least one node was leaf contracted into LST(B) at iteration k — 1. □ 

4.3.1. Local Correction. We start by finding the predecessor of A in Ts- Call this node B. We refer 
to B as the insertion point. A potentially falls between B and any number of its children{B). That 
is, A may replace B as the parent of a set of nodes D C children{B). We emphasize that the parent 
and child relationship here is over Hs and not the Line- Leaf Tree T. We use D to identify two 
other sets of nodes C and L. The set C represents nodes that, in Ts^ were leaf-contracted into B in 
the direction of some edge (S, Dj) where Dj ^ D. The set L represents nodes that were involved 
in the contraction process of B itself. Depending on type(5) the composition of L falls into one 
of the following two cases: 

(1) if type(S) = LINE then let PARENt(S) = {E,F). Let De and Dp be the two neighbors 
of B on the path from E to F. li De and De are in D then L = {E, F}. If only i?^ is in 
D, then L = {E}. If only De is in D, then L = {F}. Otherwise, L = 0. 

(2) If type(S) = LEAF then let PARENt(S) = E. Let De he the neighbor of B on the path 
B ...E. Let L = {E} if De is in D and let L = otherwise. 

We call nodes appearing in either C or L stolen nodes. 
Lemma 10. Identifying B, L and C takes at most 0(h) time. 

Proof. By Theorem [t] we can identify B in 0{h) time. Now we can use Hs to identify the successors 
of B which we can use to form D. Using a single parent operation (which is clearly bounded above 
by /i), we can find either LST{E) where PARENt(S) = £; or BST{E,F) where PARENt(S) = 
(E^F). We can use the pointers offered by, in the first case, the dynamic edge query {B^E) to 
identify De and, in the second case, the dynamic edge queries {B^E) and {B^F) to identify De 
and De- With these nodes in hand, we can easily form L by checking, in constant time, if, in 
the first case, De is in D and, in the second case, if De and De are in D. Now we analyze the 
formation of the set C. For each edge (S, Y) in LST{B)^ we use Ai(S, Y) = (S, Z) to identify the 
neighbor Z of B along the path B ...Y. U Z e D then add Z to C. Since the height of LST{B) 
is bounded above by /i, we have the desired result. 

□ 
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If C and L are both empty, then A appears as a leaf in Ts' and ROUND (A) = 1. In this case, we 
only need to correct T upward since the addition of A does not affect nodes contracted in earlier 
rounds, so we call Up Correct with A and B. However, if C or L is non-empty, then A is an 
interior node in T^/ and A essentially acts as B to the stolen nodes in C. Thus, for every edge query 
(B, Ci) where Ci G C, we remove (B, Ci) from LST{B) and insert it into LST(A). In addition, we 
create a new edge (B, A) and add it to Hs which yields Hs'- 

Lemma 11. The edge query removals from LST(B) and their insertion into LST(A) collectively 
take time proportional to the height of LST{B). 

Proof. We can traverse LST{B)^ remove the edge queries involving nodes in C, and insert them in 
LST{A) in time proportional to the height of LST{B) since LSTs are just linked lists. For each 
stolen edge query (S, Ci) we need only replace B with A in the actual edge Ai(B, Ci) = (S, X) so 
that it becomes {A^X). These pointer updates are bounded above by the height of LST{B)^ so 
the lemma follows. 

□ 



Lemma 10 and Lemma 11 imply the following corollary. 



Corollary 12. Local correction takes 0{h) time. 



Table 1. The Transition function which serves as a helper to insert. 





Updated Properties 


Data Structure Updates 


round(P) = M2(^) + 1 




ifMl(p) = 


= /i2(P) then Up Correct P at insertion point Q 





else let M = pi{P) 


CO 

< 




remove edge (P, M) from LST{P) 




PARENT (M) ^ Q 


BST(Q, M) ^ Down Correct BST(Q, P) and BST(P, M) 
create edge (Q, M) from BST(Q, M) and insert it into LST(Q) 



Local correction leaves us with candidate versions of LST(A) and LST(B) as well as a set of 
nodes L. The edges in LST{A) and LST{B) remain in their respective LSTs with one small 
exception: Stealing edge queries from B and inserting them into A may cause one oi A 01 B to no 
longer adhere to Lemma [9] and we may need to continue correcting the Line-Leaf Tree upward 
or downward. 

The INSERT procedure uses a helper function, Transition, to identify these situations and tran- 
sition into either Down Correct or Up Correct: given two nodes P and Q where round (P) < 
ROUNd((5), it determines if P was line contracted between pi{P) and Q at some earlier round. If 
this is the case, then the contraction process has been repaired except for node P which may be 
out of place on the line from pi{P) to Q so it calls Dov\^N Correct on (pi(P),P) and (P, Q) to 
finish the contraction process. Otherwise, we have patched the contraction process for all rounds 
up to ROUND (P) so we Up Correct P and Q to complete the repair. A formal description of 
Transition appears in Table [T] and a formal description of insert appears in Table [2| 

Theorem 13. Insert takes 0{h) time. 

Proof. The heart of our proof is showing that INSERT arrives at a scenario where Transition can 
be called. We show this by exhaustively examining how INSERT deals with all possible ROUND, 
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Table 2. The Insert algorithm: after locahy correcting A around its predecessor 
B, INSERT uses Transition to either Down Correct or Up Correct. Below, 
k denotes round(5) before insertion and BST{A^ B) is an empty BST for the new 
edge (A 5). 



insert(A, B) 


Apply Local Correction which yields candidate versions of LST(A) and LST(B) and a new edge (A, B). 


Updated Properties 


Data Structure Updates 


Case 1: type(S) = leaf and parent(S) = null 


if (1) =M2(S) = k 


- 1, or (2) Ml(-B) = k - 1 and M2(A) < A; - 2, or 


(3) Ail(S) = M2(S) = k - 2 and /X2(A) < k - 1 then 


if = M2(A) = i^i(B) = i^2(B) then 
round(S) round(S) + 1 


Transition (A, B) 


else 


round(A) ^ Ml (A) + 1 

TYPE (A) ^ LEAF 
PARENt(A) ^ NULL 


A becomes new root of the Line-Leaf Tree 
Transition (B,A) 


Case 2: type(S) = LEAF, parent(S) = E, and L = 


if Ail (A) = /Li2(A) = k — 1 then 


round(A) ^ Ml (A) + 1 


remove edge (B , E) from LST(E) 


TYPE (A) -h- LEAF 


BST(A, E) ^ Down Correct BST{A, B) and BST(B, E) 


parent(A) E 


insert edge (A, F) into LST{E) 


else Transition (A, 


Case 3: type(S) = LEAF, parent(S) = E, and L = {E} 


round(A) Ml (A) + 1 


replace B with A in BST(B, E) which becomes BST{A, E) 


if /X2(-B) < fc - 1 then 


TYPE(A) LEAF 

parent(A) E 


remove edge (B , E) from LST(E) 

create edge (A, F) and insert into LST(E) 

Transition (F, A) 


else BST(B, E) ^ 


Down Correct BST(B, A) and BST{A, F) 


Cases 4-5: type(B) = line and parent(B) 


= (E,F) 


Let (A^, S), {B, M) be edges adjacent to B in BST{E, F) 


in the directions of £^ and F respectively. W.l.o.g. A falls between {B , M) 


Case A. L = {E} or L = {F} 


round(A) ^ Ml (A) + 1 
round(S) ^ Mi(-B) + 1 
type(A) line 


remove edge (F, M) from BST{E,F) 


replace B with A in BST{B, M) which becomes F5T(A, M) 


if Ml(-B) > Ml (A) then 




BST(B,M) ^ Down Correct FST(F,A) and BST{A,M) 




insert edge (F, M) (with BST(B, M)) back in BST{E, F) 


else if Ml(-B) = Ml (A) then 


parent(A) (£;, F) 


insert edges (F, A) and (A, M) into BST(E, F) 


else Ml(^) < Ml(A) 




insert edge (A, M) into BST(E, F) 


parent(A) (£;, F) 


remove edge (AT, F) from BST(E, F) 

BST(N, A) ^ Down Correct BST(N, B) and BST(B, A) 
create edge (AT, A) (with BST(N, A)) and insert into BST(E, F) 


Case 5: L = {F, F} or L = 


if L = {F, F} then 


type(A) ^ line 
round(A) ^ round(S) 
parent(A) (F, F) 


replace F with A in BST{E, F) 
Transition (F, A) 


else Z/ = 0: Transition (A, S) 
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Case 


type(B) 


parent(B) 


L 


1 




NULL 


L = 


2 


LEAF 


E 


3 




L = {E} 


4 


LINE 




L = {E} OT L = {F} 


5 


L = or L = 



Table 3. An exhaustive list of cases for insert. 



TYPE, and PARENT values of B as well as the contents of L after executing local correction. To help 
this verification, we summarize the list of cases in Table [sj In all the cases below ROUNd(S) = k. 

Case 1: parent (S) = NULL: From Lemma |9] we know that before insertion, either = 
IJi2{B) = IJ^s{B) = — 1 or iJii{B) = k — l>k — 2 = l^2{B) = fis{B). Consider the first case 
where, before insertion, = l^2{B) = l^?,{B) = — 1. We have two possibilities after 

inserting A. 

(a) Suppose that after insertion, iJii{B) — /J^2{B) = fc — 1. From Lemmajoj this implies that 
max{/X3(S), /xi(74)} = A: — 1 and B has degree at least 3 at the beginning of iteration 
k — 1. Similarly, if = M2(^) = — 1, then A also has degree 3 at the beginning of 
iteration k — 1. After iteration A: — 1, either B survives alone, or B and A each survive 
with degree 1. We keep B as the root of the Line-Leaf Tree; if A survives together 
with 5, we increment ROUND (S) and this completes the correction. 

(b) Suppose that after insertion, iJi2{B) < < A: — 1. This implies that iJ^i{A) = 
IJ^2{A) = A: — 1. Thus, analogous to above, A survives alone after iteration k — 1. A 
becomes the new root of the Line-Leaf Tree. We can now apply the Transition 
function with P — B and Q = ^ to correct B on the path between A and pi{B). This 
completes the correction. 

Now, suppose that before insertion iJii{B) — k — l>k — 2 — iJi2{B) — /J^siB). 

(a) Suppose that after insertion /J^i{B) = A: — 1 where M = pi{B). This implies k — 2 > 
Pi(A) > ii2{A). Thus, after k — 2 iterations, T^-/ is either a line with endpoints M and 
B {A was contracted earlier), or a line with endpoints M and A {B may be on the 
chain of degree 2 nodes connecting M and A or may have been contracted earlier). If 
lJii{A) — P2{A) = A: — 2, then A survives until iteration k — 1. Even if B survives as 
well {ii2{B) = A: — 2), it is line contracted into BST{A, M). Thus, w.l.o.g. A becomes 
the new root node. We then apply Transition with P = B and Q = ^ to correct the 
path between A and pi{B). This completes the correction. If iJi2{A) < A: — 2, then A 
does not survive until iteration A: — 1. This means that P2(B) — k — 2 and B survives 
together with M. Without any loss of generality, we keep B as root. We then apply 
Transition with P ^ A and Q ^ B io correct A on the path between B and pi{A). 
This completes the correction. 

(b) Suppose that after insertion pi{A) — k — 1. The situation is symmetric to case (a) 
above: if pi(B) = P2{B) — k — 2^ then B survives until iteration A: — 1. Even if 
A survives as well {p2{A) = ^ — 2), it is line contracted into BST{B^M). Thus, 
w.l.o.g. B stays the root node. We then apply Transition with P = A and Q = B. 
If P2{B) < A: — 2, then B does not survive until iteration A; — L This means that 
/i2(A) — k — 2 and A survives together with M. Here we make A the new root. We 
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then apply TRANSITION with P = B and Q = A to repair the path from pi{B) to A. 
This completes the correction. 

To review, if either (1) = ii2{B) = k - 1, or (2) iii{B) = k-l and < k-2, 

or (3) iJii{B) = /i2(^) = — 2 and /i2(^) < — 1, then B remains the root of the Line- 
Leaf Tree after insertion. We then apply Transition with P = A relative to Q = B. 
Otherwise, A becomes the new root and we Transition with P = B and Q = A. 

For cases 2-3, type(S) = leaf and parent(5) = E. Before insertion, after iteration 
k — 1^ B had degree 1 and was connected to E through a (possibly empty) chain of degree 
2 nodes. The chain was line contracted into BST[B^E) and B was leaf contracted into E. 

Case 2: L = 0: After insertion, if iii{A) — /i2(^) = — 1, then after iteration k — 1^ A 
has degree 1 and is connected to E through a (possibly empty) chain of degree 2 nodes 
that may contain B. Thus, the edge query (A^E) replaces edge (B^E) in LST{E). Since 
round(S) = lJii[B) + 1 < fc, S is line contracted between A and E so we Dow^N Correct 
with respect to {A,B) and {B,E) to determine BST{A,E). Otherwise, if /i2(^) <k - I, 
then A is contracted before iteration k and T is identical to T' beginning with round k. 
We keep edge {B,E) in LST{E) and apply the Transition algorithm with P = A and 
Q = B which completes the correction. 

Case 3: L = {E}: There are two subcases. First, if fii{B) = l^2{B) = — 1 after insertion, 
then after iteration k — l^B has degree 1 and is connected to E through a (possibly empty) 
chain of degree 2 nodes that may contain A. We know {B,E) remains an edge query in 
LST{E)^ but it now needs to accommodate the addition of A since ROUND (A) = fii{A) + l < 
k Thus, we remove the edge query (5, E) from LST{E) and, as a preliminary step, replace 
B with A in BST{B,E) to produce BST{A,E). Then we Down Correct with respect 
to {A,B) and {A,E) to determine the new BST{B,E). Finally, we insert {B,E) back into 
LST{E). 

Second, if after insertion ii2{B) < — 1, then B is contracted before iteration k. After 
iteration k — 1^ A has degree 1 and is connected to E through a (possibly empty) chain of 
degree 2 nodes. Thus, we remove {B,E) from LST{E), replace B with A in BST{B,E) 
to yield BST[A^E)^ and insert the new edge query {A^E). Now we're in a position to 
apply the TRANSITION function with P = B and Q = A after which we've repaired the 
contraction process. 

For cases 4-5, type(B) = line and parent(S) = (E^F). Before insertion, after iter- 
ation k — 1^ B had degree 2 and was part of a chain of degree 2 nodes connecting E and 
F. {E^ F) became the parent of B and the chain together with B was line contracted 
into BST{E, F). Let (S, N) and (5, M) be the edges representing B in BST{E, F), where 
w.l.o.g. A falls between [B^M). 

Case 4: L = {E} or L = {F}: After insertion, ROUND(fi) = /ii(S) + 1 and ROUNd(74) = 
/ii(A) + 1, since each of the two nodes is line contracted right after it has accumulated 
all of its leaves. We examine what happens to the Line-Leaf Tree after iteration k — 1. 
If iJii{B) > /ii(A), then the current Line-Leaf Tree looks identical to the one prior to 
insertion since A is line contracted at a prior iteration somewhere between B and M. If 
lii{B) = /ii(A), then both A and B have degree 2 and are part of a chain of degree 2 
nodes connecting E and F. A and B are line contracted together into BST{E^F). If 
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< then A has replaced B in the chain of degree 2 nodes connecting E and F. 

A is hne contracted into BST{E, F). 

In the data structure, we always replace B with A in BST{B, M) to create BST{A, M). 
If > we Down Correct BST{B, A) and SST(A, M) which yields a correct 

version of BST{B,M) which we insert back into BST{E,F). If /j.i{B) = we insert 

edges {A,B) and (A, M) into BST{E,F). If < we repair the line between 

A and so that B settles in its proper place. We remove BST{N, B) from BST{E, F) 
and Down Correct BST{N, B) and BST{B, A) to produce BST{A, N) which we insert 
back into BST{E^F). This concludes the contraction process. 
Case 5: L = {E^F} or L = 0: If L = then after iteration A: — 1, A is connected 

to E and F by chains of degree 2 nodes (identical to the ones for B pre-insertion). If 
lJii{B) — iJi2{B) — k — 1^ then B also survives as a neighbor of A. Thus, S is a leaf at 
some point in the contraction process so we Up Correct B at insertion point A (this 
happens via the call to Transition). If B does not survive, then A is line contracted into 
BST{E, F) analogously to how B was pre-insertion. In the data structure, we replace B 
with A in BST{M, B) and BST{N, B) and call Transition to potentially repair the path 
from pi{B) to A. This completes the correction. 



If L = 0, then after iteration — 1, S is connected to E and F by chains of degree 2 nodes 
(identical to the ones pre-insertion). If /ii(A) = M2(^) = — 1, then A also survives as a neighbor 
of B. Thus, A is a leaf at some point in the contraction process so we Up Correct A at insertion 
point B. If A does not survive, then B is line contracted into BST(E^ F) analogous to pre-insertion. 
In the data structure, we need only worry about correcting the path from pi{A) to B which is done 
via Transition. This completes the correction. 

What's left to show is that INSERT runs in time proportional to the height of the Line-Leaf 
Tree. From Corollary [12] local correction operates in 0{h) time. Furthermore, each case of insert 
performs at most 0(1) BST, LST, Down Correct, and Up Correct operations - each of which 



takes at most 0{h) time (See Lemma 14 for Down Correct and Lemma 15 for Up Correct). □ 



4.3.2. Down Correction. Down correction repairs the contraction process along a path in the par- 
tial order tree. More specifically, down correction takes two binary search trees BST{E, B) and 
BST{B^F) where ROUNd(5) does not respect the contraction process and returns a third search 
tree BST{E, F) where B has been floated down to the BST created in same round as B. In all cases, 
we know the edge (£", F) appears at some point in the contraction process. We assume that if X is a 
node on the path from E to F then both ROUNd(X) and LST{X) are well-formed and correct. This 
includes includes B — it is simply out of place structurally with respect to the contraction process. 
Moreover, and without loss of generality, we assume that round(S) < round(£^) < round(F) 
and that if round(£^) = round(5) then type(£^) = leaf. Down correction is used as a subroutine 
by both insertion and deletion. A formal description of the algorithm is given in Table [4j 

Lemma 14. Let BST(E, B) and BST{B^F) he binary search trees along a path from E to B 
to F where LST{X) and ROUNd(X) are correct for every node X on the path from E to F . 
Furthermore let round(S) < round(£;) < round(F) and round(£;) = round(S) only when 
type(£') = LEAF. If B does not respect the contraction process with respect to the path from E to 
F then Down Correct returns BST{E, F) in 0{h) time where B occupies the correct position. 

Proof. We begin by showing that Down Correct successfully repairs the contraction process. 
Suppose BST(E^ B) was created in round m where m = if BST{E^ B) is empty and suppose 
BST{B, F) was created in round n where n = if BST{B^ F) is empty. The proof is by structural 
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Table 4. Down Correct: Given BST{E,B) which was created in round m 
(m = if BST{E^ B) is empty) and BST(B, F) which was created in round n 
(n = if BST{B^F) is empty), return BST{E^F) where B appears in the correct 
position in BST{E,F). 



Down Correct {BST{E, B), BST{B, F)) 


Updated Properties 


Data Structure Updates 


Cases 1—5 


TYPE(B) = LINE 




Case 1: m < ROUNd(S) = n 


parent(B) = {E, F) 


insert {E, B) into BST(B, F) which becomes BST(E, F) 
return BST(E, F) 


Case 2: m,n < round(B) 


parent(S) = (E, F) 


insert edges {E, B) and (B, F) into a new (empty) BST{E, F) 
return BST(E, F) 


Case S: m = round(B) = n 


parent(B) = (E, F) 


merge BST(E, B) and BST{B, F) into BST(E, F) 
return BST(E, F) 


Case 4: m < round(B) < n 


Let {B, N) be the edge queries representing B in BST{B, F), where N ^ F 


PARENT(Ar) ^ {E, F) 


remove edge (B, AT) from BST{B, F) which becomes becomes BST{N, F) 
BST(E, N) ^ Down Correct BST(E, B) and BST(B, N) 
create edge (F, AT) with BST{E, N) 

insert edge (F, iV) into BST{N, F) which becomes BST{E, F) 
return BST(E, F) 


Case 5: m,n > round(S) 


Let (M, B) and {B, N) be the edge queries 
representing B in BST(E,B), BST(B,F), respectively. 


parent(M) ^ (E, F) 


remove edge (M, B) from BST{E, B) which becomes BST(E, M) 
remove edge (S, iV) from BST(B, F) which becomes BST{N, F) 
BST(M, N) ^ Down Correct BST(M, B) and BST(B, N) 


if m > n (w.l.o.g) then 


PARENT(iV") -h- (M, F) 


create edge (M, AT) with BST{M, N) 

insert edge (M, iV) into BST(N, F) which becomes BST(M, F) 
insert edge (M, F) into BST{E, M) which becomes BST{E, F) 
return BST(E, F) 


else m = n 


PARENT(Ar) ^ (£;, F) 


merge BST(E, M), BST(M, iV), and BST(N, F) into BST{E, F) 
return BST(E, F) 



induction. There are 5 cases. We begin with the base cases where both m and n do not exceed 
round(S). 

Case 1: m < round(fi) = n: is hne contracted after nodes in BST{E^B)^ but together 
with all the nodes in BST{B, F). We add edge query {E, B) to BST{B, F) and the result 
becomes the new BST{E, F). 

Case 2: m^n < round(fi): B is the only node line contracted at ROUNd(B) between E and 
F. This happens after all other nodes in BST{E, B) and BST{B, F) were already hne 
contracted. We create a new BST[E^F) and populate it with edge queries (E^B) and 
iB,F). 
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Case 3: m = round(S) = n: B is line contracted together with ah nodes in BST{E, B) and 
ah nodes in BST{B,F). We merge BST{B,E) and BST{B,F) and the result becomes 
the new BST{E,F). 

Case 4: m < round(S) < n: B is line contracted either after or at the same time as nodes 
in BST{E,B), but before any of the nodes in BST{B,F). Let {B,N) be the edge query 
representing B in BST{B, F), where N cannot be F since BST{B^ F) is not empty. Then, 
we know that B is line contracted somewhere between E and at a previous iteration. 
Remove (B, N) from BST{B^ F) (which becomes BST{N, F)). Recursively down correcting 
on BST{E, B) and BST{B, N) yields a correct BST{E, N) which we insert into BST{N, F) 
to produce BST{E,F). 

Case 5: round(S) < n,m: B is line contracted before any of the nodes in either BST{B, E) 
or BST{B,F). Let (S,M) and {B,N) be the edges representing B in BST{E,B) and 
BST{B^ F), respectively where M cannot be E and N cannot be F, since the two BSTs 
are not empty. We remove (M, B) and (S, TV) from BST{E, B) and BST{B, F), obtaining 
BST{E, M), BST{N, F). Then, we know that S is contracted somewhere between M and 
at a previous iteration: inductively down correcting BST{M^ B) and BST{B^ N) yields 
BST{M^B). If w.l.o.g. m > we proceed similarly to Case 4: we insert (M^N) into 
BStIn.F) (creating BST{M,F)) and then insert (M, F) to BST{E,M). The result is 
the new BST(E^ F). Otherwise, if m = n, we proceed similarly to Case 3: we merge 
BST{E, M), BST{M, TV), and BST{N, F) to create the new BST{E, F). 

What's left to show is that Down Correct operates in at most 0{h) time. Each recursive 
call (in cases 4 and 5) operates on, minimally, BST{B, N) where (B^N) is the edge bordering 
some path from B to F that was line contracted into BST[B^F) at iteration n. From Lemma [oj 
BST{B^N) was created at a previous iteration, so the algorithm halts after visiting at most / 
BSTs. Because (S, N) is always a bordering edge, the sum of the heights of the / BSTs is bounded 
above by 0{h). Since we perform at most 0(1) BST operations on each of the / BSTs, and these 
operations run in time proportional to the height, we have the desired bound. □ 

4.3.3. Up Correction. Suppose we are inserting A into T at insertion point B where ROUNd(74) < 
ROUND (S) and ROUND (A) and LST{A) are correct with respect to T^ Suppose further that we 
know type(74) = leaf and that (A, B) appears as an edge during some iteration of the contraction 
process (initially, A and B are neighbors in Hs). The addition of A may change the contraction 
process with respect to B (and these changes may propagate to later iterations of the contraction 
process). Up correction repairs the contraction process in this situation. A formal description of 
the Up Correct algorithm is given in Table [5j 

Lemma 15. Up Correct repairs the contraction process in 0(h) time so that T — T' . 

Proof. We begin by showing that Up Correct correctly repairs the contraction process. The 
proof is by structural induction. Let A be the node we are inserting at insertion point B where 
ROUNd(74) < ROUNd(5) = k. We distinguish six cases which are detailed below. 

Case 1: : round(A) < round(S), parent(S) = NULL, and fi2{B) < jii^B) = round(A) = 
k — 1 Since PARENt(S) = null, B is the root of T. By Lemma [9| B have either (i) 
/ii(S) = fc-l>A:-2 = ^2{B) = /X3(S) or (ii) ^i{B) = ^2{B) = ^^{B) ^ k - I. This 
case handles the scenario that inserting A into LST{B) (which is the default action when 
round(A) < round(S)) leads to /xi(S) = /i2(S) = fc-l>A:-2 = /i3(S) and B ceases 
to be the root. 

Before insertion, at the beginning of iteration /c — 1, the partial order tree consisted of 
nodes B and pi{B) = M connected by a (possibly empty) chain of degree-2 nodes. We 
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Table 5. Up Correct: insert node A at insertion point B where round(A) < 
ROUNd(5) and LST{A) is fully-formed and correct with respect to T^ 



Up Correct {A,B) 


Updated Properties 


Data Structure Updates 


Cases 1—6 


TYPE(A) ^ LEAF 




Case 1: round(A) < round(B), parent(B) = null, and iJi2{B) < ijli{B) = round(A) = k- l 


Let pi{B) = M 


PARENT(M) ^ NULL 


remove edge {B, M) from LST{B) 


PARENT (A) ^ M 


BST{A,M) ^ Down Correct BST{A,B) and BST{B,M) 


round(B) ^ round(B) — 1 


create edge (A, M) from BST{A, M) and insert it into LST(M) 


round(M) ^ round(M) + 1 


M becomes new root of the Line-Leaf Tree 


Case 2: round(A) < round(5) (and Case 1 does not apply) 


parent(A) ^ B 


insert edge (A, B) into LST(B) 


Case 3: round(A) = round(B), type(B) = leaf, and parent(B) = E 




remove edge (5, E) from LST{E) 


PARENT (A) ^ E 


BST{A,E) ^ Down Correct BST{A,B) and BST{B,E) 
create edge (A, E) from BST{A, E) and insert it into LST(E) 


Cases 4-6: round(A) = round(B), type(S) = line, and parent(B) = (E,F) 


parent(A) ^ B 


insert edge (A, B) into LST{B) 


round(B) ^ round(B) + 1 


spht BST{E, F) into BST{E, B) and BST{B, F) 


Case 4: ROUND (5) + 1 < ROUND (£;), ROUND (F) or 


round(B) + 1 


= round (F) < ROUND (F) and type(F) = leaf 




create new BST(E, F) 




insert edges (F, B) and (B, F) into BST(E, F) 


Case 5: round(B) + 1 = round(£J) and type(£J) = line 


Let parent(£;) = (G,H), where H may be F 


parent(B) ^ (G, H) 


remove edge (F, F) from BST{G, H) 

insert edges (F, B) and {B, F) into BST{G, H) 


Case 6: round(F) < round(B) + 1 < round(F) 


type(F) must be leaf and parent (F) = F 


type(B) ^ leaf 


remove edge (F, F) from LST(F) 


PARENT (F) ^ B 


insert edge (B, F) into LST(B) 


if parent(F) = NULL and round(F) = round(5) + 1 then 


PARENT(5) ^ NULL 

parent (f) ^ b 
round(F) ^ round(F) -1 


insert edge (B, F) into LST{B) 

B becomes new root of the Line-Leaf Tree 


else Up Correct B at insertion point F 
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line contracted the chain into BST{B^M) and installed B as the root node. During the 
insertion procedure so far, M was neither stolen from 5, nor removed from LST{B)] M 
survives after iteration k — 2. Since round(A) = k — 1^ A also survives. Thus, at the 
beginning of iteration k — 1 the poset consists of nodes M and A connected by a chain of 
degree-2 nodes containing B. Now, we should line contract this chain into BST{A^ M). We 
leaf-contract A into M and arbitrarily assign M to be the root. 

To repair the data structure, we replace B with M as the root node (we decrease 
round(5) to A: — 1 and we increase round(M) to k). We also remove edge {B,M) from 
LST{B) and we insert a newly created edge (M, A) into LST{M). To construct the new 
BST(A^ M) we appeal to down correction: the contraction process is correct along the path 
from A to M — only ROUND (5) has changed. 

Case 2: round(A) < round(B) and Case 1 does not apply: Node A is leaf contracted 
into LST{B) before iteration k and does not change T beyond this iteration. In the data 
structure, we insert {A^B) into LST{B). 

Case 3: round(74) = round(S), type(S) = leaf, and parent(B) = E: Before insertion, at 
the beginning of iteration A:, B had degree 1 and was connected to E through a (possibly 
empty) chain of degree-2 nodes. The chain was line contracted into BST{B^E) and B was 
leaf contracted into LST{E). After insertion, B has degree 2 with neighbors A on one side 
and the chain ending with E on the other. The chain, together with is line contracted 
into BST(A^ E) and A should be subsequently leaf contracted into LST{E) instead of B. 
To repair the data structure, we replace edge (B^E) with edge (A^E) in LST{E) where 
BST{A,E) comes from down correcting BST{A,B) and BST{B,E). 

For all subsequent cases (4-6), ROUNd(A) = round(5) and type(S) = line. Prior to insertion, 
at the beginning of iteration /c, B had degree 2 and was part of a chain of degree 2 nodes connecting 
E and F. The chain was line contracted into BST{E, F). After insertion, B has degree 3 with the 
chain accounting for 2 and A accounting for 1. The two sides of the chain are now line contracted, 
independently, into BST{E^B) and BST{B^F)^ respectively. A is subsequently leaf contracted 
into LST{B). Thus, B survives an extra iteration and we examine the fate of 5, E^ and F in the 
cases below. 

To repair the data structure, we split the former BST{E,F) into BST{E,B) and BST{B,F) 
which we associate with edges {E^B) and (B^F)^ respectively. We also insert edge (A^B) into 
LST{B). Since B survives one extra iteration, ROUND (S) increases by 1 after Up Correction. To 
avoid confusion, throughout the rest of this section we continue to let ROUND (fi) = k refer to 
the pre-insertion value unless otherwise specified. We also assume w.l.o.g. that min {round (£"), 
round(F)} = round(£;). 

Case 4: This case has the following two sub-cases: 

(a) round(S) + 1 < round(£'), round(F).: After insertion, at the beginning of itera- 
tion k + 1^ B has degree 2 and its only neighbors are E and F. Thus, B is the only 
node line contracted into BST(E^ F). 

(b) round(S) + 1 = round(£') < round(F) and type(£') = leaf.: Like (a), at the 
beginning of iteration k + 1^ B has degree 2 and its only neighbors are E (with degree 
1) and F. B is the only node line contracted into BST{E^F)^ with E subsequently 
leaf contracted into LST[F) in the same iteration. 

In both cases, to repair the data structure, we create a new BST{E, F) and insert the edges 
queries (E^B) and (B^F). Then we replace the existing search tree associated with the 
edge query {E,F) with our new BST{E,F). 
Case 5: round(S) + 1 = round(£') and type(£') = line: Before insertion, B was line con- 
tracted into BST{E^ F) at iteration k and E was line contracted into some BST{G^ H) at 
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iteration A: + 1. After insertion, at the beginning of iteration k + 1, E and B are part of a 
chain of degree 2 nodes connecting G and H. 

If ROUND < ROUND (F), then H = F. Thus, B should be hne contracted into 
BST{G,F) together with E. Similarly, if ROUNd(£;) = ROUNd(F) and type(F) = LEAF, 
then H — F and B should again be line contracted into BST{G, F) together with E. Fi- 
nally, if round(£;) = round(F) and type(F) = line, then E, B, and F become part of 
the same chain of degree-2 nodes between G and i7, where parent(£') = parent(F) = 
(G, H). E, B, and F should all be hne contracted into BST{G, H). 

In all cases, to repair the data structure, we remove F) from BST{G, H) and we add 
edge queries {E, B) and {B, F) to BST{G, H), where PARENt(£;) = (G, H) before insertion. 
Case 6: round(£') < round(S) + 1 < round(F): parent(S) = {E, F) implies k < ROUNd(£;), 
with equality implying type(£^) = LEAF. Since ROUND < + 1 we conclude k = 
round(£') and type(£') = leaf. Before insertion, at the beginning of iteration fc, E had 
degree 1 and B was part of a chain of degree 2 nodes connecting E and F. The chain 
was line contracted into BST{E, F) and then E was leaf contracted into LST{F). After 
insertion, B has degree 3, connected to each of E (degree 1), A (degree 1), and F by (pos- 
sibly empty) chains of degree 2 nodes. Thus, E and A should now be leaf contracted into 
LST{B). To repair the data structure, we remove (£", F) from LST(F) and we add (£", B) 
to LST{B). 

In the event that PARENT (F) = NULL and ROUND (F) = A; + 1, we must reconfigure the 
top of the Line-Leaf Tree. We examine LST{F)^ which has not yet been modified by 
the current call to Up Correct. Without loss of generality, fii{F) = k and pi{F) = E. 
If fi2{F) < then Lemma [9] implies fi2(F) = M3(^) = ^ — 1- Thus, at the beginning of 
iteration k, B has degree 3 as described above and all of F, A, and F have degree 1. They 
should be leaf contracted into LST{B), leaving B as the new root and sole node to survive 
until iteration /c + 1. However, if fi2{F) = fc, then Lemma [9] implies fi2{F) = l^s{F) = k. 
Thus, at the beginning of iteration k, B has degree 3 as described above, but F has degree 
at least 3 as well. After another leaf contraction, only B and F remain. For consistency, 
we choose B as the root (replacing F). To repair the data structure, we replace F with B 
as the root node (we decrease round(F) to k). We also insert {B^F) into LST{B). 

Otherwise (parent(F) ^ null or round(F) ^ /c + 1), so we can apply Up Correct 
recursively to determine how B and F interact. Thus, the fate of edge query (B^F) is 
determined by up-correcting B at insertion point F. 

What's left to show is that Up Correct operates in 0{h) time. In all of the non-recursive cases 
(1-5, parts of 6) Up Correct performs at most 0(1) EST or LST operations. Furthermore, all 
calls to Down Correct occur in non-recursive cases, so, by Lemma 14 all the non-recursive cases 
meet the desired bound. Now we address the recursive part of case 6. Consider the path in the 
Line-Leaf Tree from the root of T to B. This path includes a sequence of queries in LST{F) 
down to (F^E), a sequence of queries in BST{E, F) ending at B. We associate 0(1) tokens with 
each edge query on this path and 0(1) tokens with each node appearing in an edge query on this 
path so that the total number of tokens allocated is at most 0{h). We use these tokens to pay for 
the operations collectively performed by all the recursive calls. First, note that because the round 
of the insertion point always increases, we never consider a particular LST or EST more than 
twice on any complete execution of Up Correct. Prior to the recursive call, we insert {A, B) 
and (S, F) into LST{B) which takes 2 tokens away from B since insertion occurs at the head of 
the LST. We remove edge (F, F) from LST{F) which takes 1 token away from each edge query 
in LST{F) leading down to (E^F). Finally, the splitting BST{E^F) takes time proportional to 
the height of BST{E^F)^ so we can pay for this operation using the tokens allocated to the edge 
queries appearing in BST[E^F). Since we only consider each of these data structures twice on 

21 



any complete execution of Up Correct, we never run out of tokens. Thus, we have the desired 
bound. □ 



4.4. Delete. In this section we describe an efficient method to restructure the Line-Leaf Tree 
when a node A is deleted from T. We begin with a definition. 

Definition 16. Let S be a node in the Line-Leaf Tree such that round(S) = fc. B is fragile 
if any of the following hold: 

(1) parent(S) null and = - 1 > /c - 2 = /i2(S) = /i3(S) > l^4{B) or 

(2) type(S) = LEAF and iii{B) = ^2(5) = ^ - 1 > l^siB). 

(3) type(S) = LINE and jiii{B) = k - 1 > ^2(5). 

A fragile node is one that barely adheres to Lemma [9j Fragile nodes play an important role 
in deletion because they are not robust to changes: removing an edge query (A, B) from LST{B) 
where S is a fragile node and ROUNd(74) = l^i{B) (or, in the case that PARENt(S) = NULL and 
> iJi2{B) = /i3(S) > l^4{B), when ROUND (A) = l^s{B)) invalidate the contraction process. 
This motivates the notion of instability: a node B becomes unstable if and only if we remove some 
edge query {A^B) from LST{B), or change the round of some node A such that (A^B) appears 
in LST{B), and Lemma [o] no longer holds. A node is stable if it adheres to Lemma [9j Table [g] 
gives a formal description of the STABILIZE procedure which repairs the contraction process in a 
Line-Leaf Tree T when a single node B becomes unstable. 

Lemma 17. stabilize correctly repairs the contraction process when a single node becomes unstable 
in O (log It;) • OPT time so that T —T' . 

Proof. We begin by proving correctness by structural induction. Suppose B is unstable. 

Case 1: PARENT (^5^ = NULL. (1) Suppose jJii^B) = k- l>k-2 = l^2{B) = M3(^)- Before the 
change, M = pi{B) and B were the only two nodes that survived until the final leaf contraction at 
ROUND k — 1. After deletion, we may have the following two anomalies, which may appear when 
S is a fragile node: 

(a) Suppose /J^i{B) = l^2{B) — 1^?>{B) — k — 2. This happens if M changes. Now B alone survives 
to iteration A: — 1. We set ROUND (5) = l^siB) + 1 = A: — 1 and keep B as the root node. 

(b) Suppose iJii{B) — k — l>k — 2 — iJi2{B) > k — — /J^siB) = lJiA{B). This happens if the 
changed node is P2{B) or p3{B). Call this node N. Now, at the leaf contraction step of iteration 
A: — 2, M is a full node and all its children have degree 1 (including B). Thus, we set ROUNd(S) 
= P3{B) + 1 = A: - 2, ROUND (M) = P2{B) + 1 = A: - 1, and instah M as the new root. To repair 
the data structure, we remove edges {B, N) and (M, B) from LST{B) and we create edge (M, N) 
which we insert into LST{M) (the new root) after we Dow^N Correct (M, B) and (S, N). 

(2) Suppose pi(B) = P2{B) — PsiB) = A: — 1. After deletion, we may have only one anomaly 
which may appear when 5 is a fragile node: pi(B) = P2{B) — k — 1 > k — 2 — ^^{B). This happens 
if the changed node is ps{B). Let P2{B) = N. After deletion, at the beginning of iteration A: — 1, 
M and N have degree 1 and are connected by a chain of degree- 2 nodes containing B. We line 
contract B into BST{M, N) and choose M as root. Thus we set ROUND (S) = PsiB) + 1 = A: - 1, 
ROUNd(M) = P2{B) + 1 = A:. To repair the data structure, we remove edges {B^N) and {M^B) 
from LST{B) and we create edge (M, N) which we insert into LST{M) (the new root) after we 
Dov^N Correct (M,5) and {B,N). 
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Table 6. stabilize : Given a Line-Leaf Tree T with a single unstable node B, 
repair T so that every node is stable. 



stabilize(B) 


Updated Properties 


Data Structure Updates 


Case 1: parent(B) = null, and (1) Ati(B) = A^2(^) = M3(^) = k — 2, or 


(2) i2i(B) = k-l> 122(B) = k-2> 123(B) = /c - 3, or 


(3) 121(B) = 122(B) = k - 


1 >k -2 = 123(B) 


round(B) ^ M3(^) + 1 




if 121(B) = k-l then 


Let pi(B) = M and p2(B) = N 




remove edges (B, M) and (B,N) from LST(B) 


round(M) ^ P'2(B) + 1 


BST(M, B) ^ Down Correct (AT, B) and (B, M) 


PARENT (M) ^ NULL 


create edge (M, AA) from BST(M, N) and insert it into LST(M) 
M becomes new root of the Line-Leaf Tree 


Case 2: type(B) = leaf 


, parent(B) - E, and pi(B) = k-l> k-2 = 122(B) 


round(B) ^ IJ'2(B) + 1 


remove edge (B, E) from LST(E) 
remove edge (M, B) from LST(B) 


Let pi(B) = M and let (B,N) be the edge query representing B in BST(B,E) 


if N ^ E and ROUND (AT) = k then 


TYPE(A^) ^ LEAF 
PARENT (AT) ^ ^ 
PARENT (M) ^ A/" 


remove edge (B, A/") from BST(B,E) which becomes BST(N,E) 
create edge (AT, ^) from BST(N, E) and insert it into LST(E) 

BST(M,N) ^ Down Correct BST(M,B) and BST(B,N) 


create edge (M, AT) from BST(M, N) and insert it into LST(N) 


else 


PARENT (M) ^ E 


BST(M, E) ^ Down Correct BST(M, B) and BST(B, E) 


create edge (M, ^) from BST(M, E) and insert it into LST(E) 


if ^ is unstable then stabilize(£^) 


Case 3: type(B) = line, 


parent(B) = (E,F), and 121(B) = k - 2 




spht BST(E,F) into BST(E,B) and BST(B,F) 


round(B) ^ + 1 


BST(E, F) ^ Down Correct BST(E, B) and BST(B, F) 
attach new BST(E, F) to edge (^, F) 



Case 5; PARENTIS; = Let (B,N) be the edge query representing B in BST{B,E). li N ^ E 
(i.e. BST{B^E) was not empty) and round(A^) = fc, then after deletion, at the beginning of 
iteration A: — 1, node pi{B) = M has degree 1 and is connected to node N through a chain 
of degree-2 nodes containing B. We line contract B into BST{M, N) and leaf contract M into 
LST{N). Subsequently, at the beginning of iteration /c, N has degree 1 and the path N..E is 
a (possibly void) chain of degree 2 nodes. We line contract this path into BST[N^E) and leaf 
contract into LST{E). To repair the data structure, we remove edge (B^N) from BST{B^E) 
which yields BST{N,E). Then, we set round(5) = ^2{B) + 1 = /c - 1 and let BST{M,N) 
be the result of Down Correct BST{M,B) and BST{N,B). We create a new edge (M,iV) 
with BST{M^N) attached and insert it into LST{N). Finally, we create a new edge (N^E) with 
BST{N^E) attached and we insert it into LST{E). Since N effectively replaces B (with the same 
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Table 7. delete: given a node A, remove A from T and repair the contraction 
process so that T —T' 



DELETE (A) 


Updated Properties 


Data Structure Updates 




remove edge (A, B) from Hg/ 
replace A with B nominally 
merge LST{A ^ B) and LST{B) 


if round(B) < round(A) or 
round(B) = round(A) and type(A) = leaf, type(B) = line then 


type(B) ^ type(A) 
round(B) ^ round(A) 
parent(B) ^ parent(A) 


insert LST(B) at location A 


else keep LST(B) at location B 


finally if B is unstable then stabilize B 



ROUND, TYPE, and parent) in LST{E), E cannot become unstable and the correction process is 
complete. 

Otherwise, after the change to LST{B)^ at the beginning of iteration A: — 1, node pi{B) — M 
has degree 1, and is connected to node E through a chain of degree-2 nodes containing B. We line 
contract B into BST{M, E) and leaf contract M into LST{E). To repair the data structure, we set 
round(S) = iJi2{B) + 1 = A; - 1 and let BST{M, E) be the result of Down Correct BST{M, B) 
and BST{B^ E). We create a new edge (M, E) with BST{M^E) attached and replace edge query 
{B,E) in LST(E) with (M,E). Since E has now lost a node of round k (namely B) from its 
LST, it may be unstable. If this is the case, then recursively stabilizing E finishes the correction 
process. 

Case 3: PARENT (^S^ = {E,F). After the change to LST{B), at the beginning of iteration /c - 1, 
node B has degree 2. We know B is line contracted on the path E to F. To repair the data 
structure, we split BST{E, F) into BST{E, B) and BST{B, F) and let the new BST{E, F) be the 
result of Down Correct BST{E,B) and where round(S) = iii{B) + 1 = - 1. 

What's left to show is that we can perform these operations in 0{\ogw) • OPT time. Each 
recursive call to STABILIZE performs at most 0(1) BST operations and at most 1 call to Down 
Correct. The cah to Down Correct always happens with BST{X, Y) and BST{Y, Z) where 
ROUND(y) = - 1, the round of BST{X,Y) is /c - 1 and the round of BST{Y,Z) is at most 
k. Thus, Down Correct will recursive at most once before hitting a base case. This means we 
perform at most 0(1) BST operations for each Down Correct call. Since there are at most 
0{\ogw) recursive calls and each BST operation takes at most 0{OPT) time, we have the desired 
bound. □ 

With STABILIZE in hand, we can formally define delete which appears in Table [7| and prove its 
correctness and time bound. 

Theorem 18. Delete takes 0{\ogw) • OPT time. 

Proof. Let A be the node we wish to delete and let B be its predecessor in Hs- Let S' — S \ {A}. 
In T^/, all the successors of A are successors of B. Thus, as a starting point in deletion, we must 

(1) remove the edge (A, B) from Hs] 

24 



(2) replace A with B in all edges {A, X) in Hs where X B; and 

(3) insert every edge query {A,X) from LST{A) into LST{B) where X ^ B. 

If, before deletion, either (1) round(74) > round(S), or (2) round(74) = round(S) and type{A) = 
LEAF, type(S) = LINE, then B essentially replaces A in the remaining rounds of the contraction 
process so type(S) = type(74), round(S) = round(74), and parent(S) = parent(74). How- 
ever, if ROUND (A) < ROUND (S), then B lasts as long as A in the contraction process so there is no 
need, initially, to update its properties. As with insertion, these steps can be performed in 0(h) 
time. Of course, deleting A may cause B to become unstable. We analyze when this occurs and 
appeal to the stabilize procedure for correctness. 

round(S) < round(74) or round(74) = round(S): and type{A) = leaf and type(S) = 

line: In this case, B was either (a) line contracted between A and some other node M or 
(b) leaf contracted into A. Let ROUND (A) = k before deletion. 

(a) : Suppose B was line contracted before deleting A. We must remove (B^A) from 
BST{M^ A) which, after deletion, becomes BST{M, B). Furthermore, after deletion, 
LST{B) contains all the edge queries from LST{A). Since round(S) < round(74) 
before deletion, B has a higher ROUND after deletion. In fact, B survives exactly as 
long as A did before deletion, effectively replacing A in all iterations of the contraction 
algorithm. Because we replaced A with B in all edges {A,X) in Hs, the contraction 
process has been corrected and we are finished. 

(b) : Suppose that B was leaf contracted into A. Then LST{B) contains all the edge 
queries from LST{A) except for {A^B). If A was a fringe node before deletion and 
ROUND (S) — k — 1 then B will not be a full node at iteration k — 1. Instead, it will 
have degree 2 which violates Lemma [9} In this case, we must continue to correct the 
contraction process which we do through the PATCH procedure. Otherwise, Lemma [9] 
holds and the contraction process is repaired. 

round(S) > round(74) or round(S) = round(74) and type(74) = line:: Here, B absorbs 
A in the search tree. B was either (a) line contracted together with A into some BST, 
(b) the parent of A, or (c) either £^ or F in the event that PARENt(74) = (E, F). 

(a) Suppose B was line contracted together with A. Then B holds all nodes formerly in 
either LST{A) or LST{B). By Lemma |) after deletion fii{B) = fi2(B) = k-l, and so B 
is line contracted as before. The contraction process doesn't change any further. 

(b) Suppose B was parent (A). There is a possibility that before insertion ROUND (A) = 
k — 1 and B was a fringe node. If this is the case, then B will not be a full node at iteration 
k — 1; instead, it will have degree 2. We address this situation in Anti-Up Correction. 
Otherwise, Lemma [9] holds for B and B effectively absorbs A. 

(c) Suppose parent(74) was (B^F) w.l.o.g. Then we simply add nodes to LST{B) of 
ROUND smaller than ROUNd(S), without removing any others. The contraction process 
does not change any further. 

To summarize, if B was contracted before A, then B replaces A\ we put the merged LST{B) 
in the position formerly occupied by A and B inherits A's attributes: TYPE, ROUND, and PARENT. 
Otherwise, B absorbs A and we put the merged LST{B) in the position formerly occupied by S, 
while B keeps its own attributes (see Table [t] for a succinct description of the algorithm). 

To prove the bound on the running time, we observe that delete makes at most 0(1) LST 
operations, each of which is at most 0{h). Thus, the call to STABILIZE dominates the running time 
of DELETE. Therefore, by Lemma [T7| we have the desired bound. □ 

Deletion is the only operation for which we do not have an 0{h) bound on the running time. 
Here is the problem: suppose we want to delete A and the path in T runs only through LSTs. 
If we recursively need to call STABILIZE on nodes appearing these LSTs, then each call to Down 
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(a) (b) 

Figure 7. Results comparing the height of the Line-Leaf Tree to the optimal 
static search search tree on (a) random tree-like partial orders; and (b) a large 
portion of the UNIX filesystem. The non-shaded areas show the average height of 
both the Line-Leaf Tree and optimal static algorithm. The shaded area shows 
their ratio (as well as the min and max values over the 1000 iterations). 

Correct may operate on a BSTs which have no ancestor / descendent relationship in the tree — 
with insertion, this never happens because any time we manipulate a BST, it's on path from the 
root down to the predecessor of the node we wish to insert. 

5. Empirical Results 

Here we show the results of two experiments which compare the height of a Line- Leaf Tree 
with the height of an optimal static search tree for a tree-like set S. For these experiments, we 
consider the height of a search tree to be the maximum number of edge queries performed on any 
root-to-leaf path. So any dynamic edge query in a Line-Leaf Tree counts as two edge queries in 
our experiments. 

In the first experiment, we examine tree-like partial orders of increasing size n. For each n, we 
independently sample 1000 partial-orders uniformly at random from all tree-like partial orders with 
n nodesj^ The non-shaded area of Figure [t] (a) shows the heights of the Line-Leaf Tree and the 
optimal static tree averaged over the samples. The important thing to note is that both appear 
to grow linearly in logn. We suspect that the differing slopes come mainly from the overheard 
of dynamic edge queries, and we conjecture that the Line- Leaf Tree performs within a small 
constant factor of OPT with high probability in the uniform tree-like model. The shaded area of 
Figure [t] (a) shows the average, minimum, and maximum approximation ratio over the samples. 

Although the first experiment shows that the Line-Leaf Tree is competitive with the optimal 
static tree on average tree-like partial orders, it may be that, in practice, tree-like partial orders 
are distributed non- uniformly. Thus, for our second experiment, we took the /usr directory of an 
Ubuntu 10.04 Linux distribution as our universe U and independently sampled 1000 sets of size 
n = 100, n = 1000, and n = 10000 from U respectively. The /usr directory contains 23,328 nodes, 

■'^In keeping with the uniform model for general partial orders defined in we assume V{n) is the set of all 
rooted, labeled, oriented trees on 1, . . . ,n such that every root-to-leaf path has labels that increase. The set V{n) is 
in one-to-one correspondence with the set of increasing trees (these tree are also known as heap- ordered and recursive 
trees) The expected worst-case and average height of a random increasing tree is G(logn) [121414] . This is in 

contrast to general partial orders which, on average, have height 3. 
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of which 17,340 are leaves. The largest directory is /usr/share/doc which contains 1551 files. The 
height of /usr is 12. We believe that this directory is somewhat representative of the use cases 
found in our motivation. As with our first experiment, the shaded area in Figure [t] (b) shows the 
ratio of the height of the Line-Leaf Tree to the height of the optimal static search tree, averaged 
over all 1000 samples for each sample size. The non-shaded area shows the actual heights averaged 
over the samples. The Line-Leaf Tree is again very competitive with the optimal static search 
tree, performing at most a small constant factor more queries than the optimal search tree. 

Acknowledgements. We would like to thank T. Andrew Lorenzen for his help in running the 
experiments discussed in Section [5j 
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Appendix A. Figures for Insertion and Deletion 

Here we provide figures describing each case of the Insert, Up Correction, Down Correction, 
Stabihze, Transition, and Delete procedures described in the main text (Table [§. 
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Table 8. Index of figures describing each case of the dynamic operations on the 
Line-Leaf Tree. 
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Figure 9. Up Correction, Case 2. 
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Figure 12. Up Correction, Case 5. 
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Figure 13. Up Correction, Case 6. 
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Figure 14. Down Correction, Case 1. 
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Figure 17. Down Correction, Case 4. 
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Figure 19. Transition, Part 1. 
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Figure 26. Insert, Case 5. 
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BEFORE DELETION 
Hl(B) = H2(B) = |i3(B)=k-1 
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Figure 28. Stabilize, Case 2. 
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Figure 29. Stabilize, Case 3. 
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Figure 30. Delete. 
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